Measuring Reproductibility of High-Throughput Biological Experiments
نویسنده
چکیده
Reproducibility is essential to reliable scientific discovery in large-scale high-throughput biological studies. In this talk, I will present a unified approach to measure reproducibility of findings identified from replicate experiments and select discoveries using reproducibility between replicates. Unlike the usual scalar measures of reproducibility, our approach views reproducibility as when the findings are no longer consistent across replicates. To measure the pairwise consistency between replicates, we develop a graphical statistic based on empirical copulas and a copula mixture model to quantitatively describe the change of consistency in the decreasing significance of findings. Based on the copula mixture procedure, we define a quantity, called ”irreproducible discovery rate”, in a fashion analogous to the false discovery rate. This quantity, which describes the lack of reproducibility for the identifications selected at each threshold, provides a reproducibility criterion for selecting reliable signals and assessing the overall reproducibility of findings. Our approach can be applied to both probabilisticand heuristic-based significance scores, and permits principled setting of selection thresholds. This method has been adopted by ENCODE consortium for selecting ChIP-seq signal identification algorithms and monitoring the performance of their experimental facility. I will illustrate the effectiveness of our method using some ENCODE examples.
منابع مشابه
Measuring Reproducibility of High-Throughput Deep-Sequencing Experiments Based on Self-adaptive Mixture Copula
Measurement of the statistical reproducibility between biological experiment replicates is vital first step of the entire series of bioinformatics analysis for mining meaningful biological discovery from mega-data. To distinguish the real biological relevant signals from artificial signals, irreproducible discovery rate (IDR) employing Copula, which can separate dependence structure and margina...
متن کاملA novel medium-throughput biological assay system for HTLV-1 infectivity and drug discovery
Objective(s): Here, a reporter cell line containing two reporter vectors were developed, in order to monitor the Human T-Lymphotropic Virus type1(HTLV-1) infectivity and the cell viability simultaneously. Materials and Methods: The reporter cell line was constructed by stably transfected baby hamster's kidney cell line (BHK-21), with the genomes expressing two different reporters in separate pl...
متن کاملRapid and high throughput regeneration in fennel (Foeniculum vulgare Mill.) from embryo explants
Callus induction and regeneration of fennel from embryo explants were stabilized in the presence of cefotaxime antibiotic and different plant growth regulators (PGRs). The experiments were conducted under a factorial experiment, based on a completely randomized design (CRD). Genotypes; Fasa, Meshkinshar and Hajiabad were applied under different concentration of cefotaxime (0 and 100 mg l-1...
متن کاملPrediction, expansion, and visualization of biological pathways and networks using perturbation data and cyclical graphical models
Cellular processes are the interaction of multiple proteins, genomic sites, RNAs, small molecules, and their complexes. The set of these interactions and their contexts provide biological understanding of functionality beyond single-gene annotation. Biological networks have emerged as the dominant method of communicating, modeling, and understanding cellular processes and pathways. Computationa...
متن کاملComputational Tools for a Novel Transcriptional Profiling Method
In this thesis we provide computational tools for the planning of VTTTRAC experiments. VTT-TRAC is a novel method for measuring expression levels of genes. Monitoring gene expression by measuring the amounts of transcribed mRNAs (transcriptional profiling) has become an important experimental method in molecular biology. This has been due to rapid advance in the high-throughput measurement tech...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011